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Abstract 

We provide a new algorithm for the treatment of the noisy inversion of the radon 
transform using an appropriate thresholding technique adapted to a well chosen 
new localized basis. We establish minimax results and prove their optimality. In 
particular we prove that the procedures provided here are able to attain minimax 
bounds for any Lp loss. It is important to notice that most of the minimax bounds 
obtained here are new to our knowledge. It is also important to emphasize the 
adaptation properties of our procedures with respect to the regularity (sparsity) of 
the object to recover as well as to inhomogeneous smoothness. We also perform 
a numerical study which is of importance since we especially have to discuss the 
cubature problems and propose an averaging procedure which is mostly in the spirit 
of the cycle spinning performed for periodic signals. 

1 Introduction 

We consider the problem of inverting noisy observations of the d-dimensional Radon 
transform. Obviously the most immediate examples occur for d = 2 or 3. However no 
major differences arise from considering the general case. 

There is a considerable literature on the problem of reconstructing structures from 
their Radon transforms which is a fundamental problem in medical imaging and more 
generally in tomography. In our approach, we focus on several important points. We 
produce a procedure which is efficient from a L2 point of view, since this loss function 
mimics quite well in many situations the preferences of the human eye. On the other 
hand, we have at the same time the requirement of clearly identifying the local bumps, 
of being able to well estimate the different level sets. We also want the procedure to 
enjoy good adaptation properties. In addition, we require the procedure to be simple to 
implement. 

At the heart of such a problem there is a notable conflict between the inversion part 
which in presence of noise creates an instability reasonably handled by a Singular Value 
Decomposition (SVD) approach and the fact that the SVD basis very rarely is localized 
and capable of representing local features of images, which are especially important 
to recover. Our strategy is to follow the approach started in [lo| which utilizes the 
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construction borrowed from [20] (see also [I4]) of localized frames based on orthogonal 
polynomials on the ball, which are closely related to the Radon transform SVD basis. 

To achieve the goals presented above, and especially adaptation to different regulari- 
ties and local inhomogeneous smoothness, we add a fine tuning subsequent thresholding 



process to the estimation performed in lOj. 



This improves considerably the performances of the algorithm, both from a theoret- 
ical point of view and a numerical point of view. In effect, the new algorithm provides 
a much better spatial adaptation, as well as adaptation to the classes of regularity. We 
prove here that the bounds obtained by the procedure are minimax over a large class of 
Besov spaces and any Lp losses: we provide upper bounds for the performance of our 
algorithm as well as lower bounds for the associated minimax rate. 

It is important to notice that especially because we consider different Lp losses, we 
provide rates of convergence of new types attained by our procedure. Those rates are 
minimax since confirmed by lower bounds inequalities. 

The problem of choosing appropriate spaces of regularity on the ball reflecting the 
standard objects analyzed in tomography is a highly non trivial problem. We decided to 
consider the spaces which seems to stay the closest to our natural intuition, those which 
generalize to the ball the approximation properties by polynomials. 

The procedure gives very promising results in the simulation study. We show that 
the estimates obtained by thresholding the needlets outperform those obtained either by 
thresholding the SVD or by the linear needlet estimate proposed in [l^]. An important 
issue in the needlet scheme is the choice of the quadrature in the needlet construction. We 
discussed the possibilities proposed in the literature and considered a cubature formula 
based on the full tensorial grid on the sphere introducing an averaging which principle 
is close to the cycle-spinning method. 

Among others, one amazing result is the fact that to attain minimax rates in Lqo 
norm, we need in this case to modify the estimator, which is also corroborated by the 
numerical results: see Theorem [2] and Figure HI 

In the first section, we introduce the Radon transform and the associated SVD basis. 
The following section summaries the construction of the localized basis, the needlets. 
Section 4 introduces our procedures and states the theoretical results: upper bounds 
and lower bounds. Section 5 details the simulation study. Section 6 details important 
properties of the needlet basis. The proof of the two main results stated in section 4 are 
postponed in the two last sections. 



2 Radon transform and white noise model 
2.1 Radon transform 

Here we recall the definition and some basic facts about the Radon transform (cf. 

nil)- Denote by B'^ the unit ball in R"^, i.e. B'l = {x = (xi , . . . , Xd] G M'^ : |x| < 1} 
with |x| = (^{^1 ^i)^^^ by S*^^^ the unit sphere in R*^. The Lebesgue measure on 
B*^ will be denoted by dx and the usual surface measure on S'^^^ by da(x) (sometimes 



2 



we will also deal with the surface measure on S"^ which will be denoted by da^)- We let 
|A| denote the measure |A1 = dx if A C B'l as weh as |A| = da(x) if A c S'^-^ . 
The Radon transform of a function f is defined by 



Rf(e,s) 



sG+yeB'' 



f(se+y]dy, 9 G s € [-1,1], 



where dy is the Lebesgue measure of dimension d — 1 and G"*" = {x G M'^ : (x, 9) = 0}. 
With a slight abuse of notation, we will rewrite this integral as 



Rf(9,s) 



(-y,e>=s 



f(y)dy. 



By Fubini's theorem, we have 



Rf(9,s)ds 



f (x)dx. 



It is easy to see (cf. e.g. [17[) that the Radon transform is a bounded linear operator 
mapping h^iB'^, dx) into (S'^"^ x [-1 , 1], dM,(e, s)) , where 



d^(9,s) = da(9)- 



ds 



1 _s2Ud-l)/2- 



2.2 Noisy observation of the Radon transform 

We consider observations of the form 

dY(9, s) = Rf (9, s)d^(9, s) + £dW(9, s) 



where the unknown function f belongs to L (B", dx). The meaning of this equation is 
that for any (p(9, s) in L^(S'^^^ x [—1 , 1], d|x(0, s)) one can observe 



S'i-' x[-1,1] 



Rf(9,s)(p(9,s)d^(9,s) + £ 



(p(e,s)dW(9,s) 



(p(e,s)dY(9,s) 
= (Rf,(p)^ + £W^. 

Here W^p = J cp(0, s)dW(9, s) is a Gaussian field of zero mean and covariance 



(p(9,s)\l;(9,s)da(e) 



ds 



S'l-' x[-1,1] 



(1_32)(d-1)/2 =(^'^)^- 



The goal is to recover the unknown function f from the observation of Y. Our idea is to 
refine the algorithms proposed in [lO] using thresholding methods. In it is derived 
estimation schemes combining the stability and computability of SVD decompositions 
with the localization and multiscale structure of wavelets. To this end a frame was used 
(essentially following the construction from with elements of nearly exponential 
localization which is in addition compatible with the SVD basis of the Radon transform. 
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2.3 Singular Value Decomposition of the Radon transform 

The S VD of the Radon transform was first estabhshed in [3] ■ In this regard we also 



refer the reader to 17|, l2j] . 



2.3.1 Jacobi and Gegenbauer polynomials 

The Radon SVD bases are defined in terms of Jacobi and Gegenbauer polynomials. 
The Jacobi polynomials P^'^''^', n > 0, constitute an orthogonal basis for the space 
l],w„,|3(t)dt) with weight Wa,|3(t) = (1 - t]'*(l +t)P, a, P > -1. They are 

standardly normalized by P^^''•'^'(^) = Ct") and then 

P^^'(t)P|;:^'^'(t)w,,p(t)dt = 5n,^h^P', 

1 



where 



M) ^ 2"^+P+i r(n + (x+1)r(n+|3 + i: 
(2n + a+(3 + 1) r(n+l)r(n + a+|3 + i; 



The Gegenbauer polynomials are a particular case of Jacobi polynomials and are 
traditionally defined by 

^-^'^ = (A + l/2)/- ^>-^/^' 

where by definition (a)n = a(a + 1)...(a + n— 1) = ^^^"^j^^ (note that in the 
Gegenbauer polynomial is denoted by P^). It is readily seen that Cj^(1 ) = {^^'^^^) = 

r(n+2A) . 
TL!r{2A) '^^^^ 

^ r^ftir^ftui t^i^-THt-s h'^' with i^tA) _ 2^'^^7t r(n + 2A) 

_^C,(t)C^(t)(l-t ) 2dt-5^,^h^ with 
2.3.2 Polynomials on B'^ and S'^^ 

Let HnlM'i] be the space of all polynomials in d variables of degree < n. We denote 
by Vn[R'^) the space of all homogeneous polynomials of degree n and by Vr^(R'^) the 
space of all polynomials of degree n which are orthogonal to lower degree polynomials 
with respect to the Lebesgue measure on B*^. Of course Vo(M'^] will be the set of all 
constants. We have the following orthogonal decomposition: 



k=0 



Also, denote by ]HIrL(K'^) the subspace of all harmonic homogeneous polynomials of 
degree n (i.e. Q G EIn(M'^) if Q G Pn(M'i) and AQ = 0) and by lIn(S'^"^ ) the (injective) 
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restriction of the polynomials from ]HIrL(M'^) to S*^ ^ . It is well known that 
Nd_i(n) =dim(Mn(S'^-M) ■ 



n+ d- 1 
d- 1 



d — 1 



Let HnlS'^"') be the space of restrictions to S of polynomials of degree < n on M . 
As is well known 

n 

m=0 

(the orthogonality is with respect of the surface measure da on S*^^^). ]HIi,(S'^^^ ) is called 
the space of spherical Harmonics of degree d on the sphere S'^^^. 

Let Y;t, 1 < i < Nd-i (I), be an orthonormal basis of ]HIt(S'^^^), i.e. 



Then the natural extensions of Yt_i on B'^ are defined by Yt_i(x) = |x|^Y;_i(^) and satisfy 

-1 



Yu(rf,)Y,t'(rf,)da(ydr 



1 



Yi,i(yYt,v(£,)da(ydr = 8,,,, , ^ 
jd-i zL + d 



For more details we refer the reader to [3]. 

The spherical harmonics on S'^^^ and orthogonal polynomials on B*^ are naturally 
related to Gegenbauer polynomials. Thus the kernel of the orthogonal projector onto 
HnlS'^^M can be written as (see e.g. [211]): 



N. 1 In 



L Y^,(yY,,(e] = ^|^t^c^^(^,e)). 



(1) 



The "ridge" Gegenbauer polynomials Cn^^((x, £,)) are orthogonal to nTT^_i(B'^) in L^(B'^) 
and the kernel Lnf2C,y) of the orthogonal projector onto VnlB*^) can be written in the 
form (see e.g. la. 24 1) 



2n + d 

(n+1)d-i 
2d7^d-i 



C^/'((x,t))C^/^((y,t))da(f,) 



(2) 



d/2. 



IC 



d/2||2 



The following important identities are valid for "ridge" Gegenbauer polynomials: 

C^l/'((£„x))C^l/'((l^,x))dx = ^5^C^/"((^,Ti)), £,,11 G (3) 
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and, for X € B'i, ri € 



see e.g. [3. By (l2|) and (gD 



and again by ^ 



Wix, yU(y , y da(y = (2n + d)Mx,y). 



(4) 



2.3.3 The SVD of the Radon transform 

Assume that {Yi,^t : 1 < i. < M^— i(l)} is an orthonormal basis for EIi,(S'^^^). Then it is 
standard and easy to see that the family of polynomials 

fk,i,iW = (2k+d)^/^p{°'^+'^/^"^'(2|x|2-l)Yv,t(x), 0<l<k, k-l = 2j, 1 <i<Nd-i(l), 

form an orthonormal basis of Vi<(B'^), see e.g. 0]. Here as before ^i^iM = |x|^Yi^ |(x/jx|). 
On the other hand the collection 

gk,i,i(0,s) = [hl^^^Y'^H'i - s^)^''-'^^^ct^^[s)Y,,m, k> 0, l>0, 1 <i< Nd-i(l), 

is apparently an orthonormal basis of L^(S'^^^ x [—1, 1], d|a.(9, s)). Most importantly, the 
Radon transform R : L.^[B'^) ^ x [-1 , 1], d^(e, s)) is a one-to-one mapping and 

Rfk.i.t = '^i<9k,i,i. R*gk,i,i = ^kfk.i.i. where 



^ (k+l)(k + 2)...(k+d-1) (k+1)a-i 
More precisely, we have: For any f G L^(B'^) 

k>0 0<l<k,k-feO (mod 2) l<i<Nd-i (I) 
Furthermore, for f e L^lB'l) 

f = L^i^' L L (l^f.9k,i,i)^fk,i,i. 

k>0 0<l<k,k-l=0 (mod 2) l<i<Nd-i (I) 

In the above identities the convergence is in L^. 



For the Radon SVD we refer the reader to 171] and 2j] and 10(] 
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3 Construction of needlets on the ball 



In this section we briefly recall the construction of the needlets on the ball. This con- 
struction is due to [13] ■ Its aim is essentially to build a very well localized tight frame 
constructed using the eigenvectors of the Radon transform. For more precision we refer 

3, Q, 12] 



to 



Let {fic,i,i} be the orthonormal basis of VklB*^] defined in §2.3.31 Denote by T]^ the 
index set of this basis, i.e. = {(1,1-) :0<l<k, l = k (mod 2), < i < Nti-i(l)}. 
Then the orthogonal projector of L^(B'^) onto Vk(B'^) can be written in the form 



f(y)Lk(x,y)dy with Lk[x,y) = Y fk,i,i(x)fk,i,i(lj)- 



Using ([T]) Lk(x,,'y) can be written in the form 

=(2k+d) Y. pr+''^'"''(2M^- 

l<k,k-l=0 (mod 2) 



„|^|,p|0,W/2-.l|2u,|2_|,|^,|l 



(2k +d) 

|Sd-i| 



p(0,l+d/2-l)(2|x|2- 



l)|^,ip(o.i+d/2-i)^2|y|^-l)|y|^ 



l<k,k-l=0 (mod 2) 



X 1 + 



I 



c 



d/2-1 f/X V 
I 



d/2-1 V\|x|'iyl 
Another representation of Lk(x,y) has already be given in Clearly 



l-m(x,z)Lk(z,y)dz = 6TT^kl-m(x,y) 



and for f G h^iB'^'' 



f = _^Lkf and ||f||i = ^||Lkf||i = _^(Lkf,f). 



(5) 



(6) 



k>0 



The construction of the needlets is based on the classical Littlewood-Paley decom- 
position and a subsequent discretization. 

Let a G [0, oo) be a cut-off function such that < a < 1 , a(t) = 1 for t G [0, 1 /2] 
and supp a C [0, 1]. We next use this function to introduce a sequence of operators on 
L^(B'l). For j > write 



Ajf(x) = ^a(^)Lkf( 



k>0 



Bd 



Aj(x,y)f(y)dy with Aj[x,y) = '^ai^-^jL^(x,y). 
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Also, we define Bjf = Aj+if — Ajf. Then setting b(t) = a{t/2) — a(t) we have 
k 



Bjf(x) = }^b(^)Lkf(x) = [ Bj(x,y)f(y)dy with Bj(x,y] = }^b(|) Wx,y) 



k 



Obviously, for f G L^(B 

/A.f i\ = V n( 

.2> 



(A^f,f)=}^a(^)(Uf,f)<||f||i 
k 



An important result from [20| (see also asserts that the kernels Aj(x,y), Bj(x,y) 
have nearly exponential localization, namely, for any M > there exists a constant 
Cm > such that 

|Aj(x,y)|,|Bj(x,y)|<CM-— — -|^_=^^==, x,y G B^, (7) 

(1 +2Jd(x,y))'^Y/Wj(x)Y/Wj(y) 

where 

Wj (x) = 2-^ + ^1 - |xp, |x|^ = jxli = ^ X?, (8) 

1=1 

and 



d(x,y) = Arccos((x,y) + yl - W\J^ - |yP), (x,y) = ^xtyi. 

i=i 

Let us define 



Cj(x,y) = ^ ^a(^^)Lk(x,z)) and Dj (x, y ) = ^ Wb (^^) Lk(x, z). 

k 



Note that Cj and Dj have the same localization as the localization of Aj, Bj in ([7]) (cf. 
0). Using dnD, we get. 



Aj(x,y] 



B'l 



Ci(x,z)Cj(z,y)dz, Bj(x,y] 



Bd 



Dj(x,z)Dj(z,y)dz. (9) 



And, obviously z i— > Cj(x, z)Cj(z,y) (resp. Dj(x,z)Dj(z,y)) are polynomial of degrees 
<V+\ 

The following proposition follows from results in [2o| and [j^ ] and establishes a 
cubature formula. 

Proposition 1. Let {B(£,i, p) : i G 1} 6e a maximal family of disjoint spherical caps of 
radius p = t2^ with centers on the hemisphere S'J;. Then for sufficiently small < T < 1 
the set of points X] = {^i : i- G 1} obtained by projecting the set {£, : i G 1} on B'^ is a set 
of nodes of a cubature formula which is exact for Wji+i (B*^); for any P G n2)+2 (B'^) , 



P(u)du= Y_ 



where, moreover, the coefficients cuj £^ of this cubature are positive and satisfy cuj_£^ 
Wj(f,)2^''^, and the cardinality of the setx] is of order V'^. 
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3.0.4 Needlets 

Going back to identities Q and applying the cubature formula described in Proposi- 
tion [H we get 



Cj(x,z)Cj(z,y)dz = Y_ <^j,£.Cj(x, £,)Cj(y,£,) and 



f.6Xi 



We define the father needlets cpj £^ and the mother needlets '^j^s, by 

tPj, = ^/^J^Cj(x,£,) and i|jj_f_(x) = ^/aJ~i;Dj(x, £,) , £,GXj, i>0- 
We also set i|j-i,o = jf^ and X-i = {0}- From above it follows that 

Aj(x,y) = Y_ ^j,£,(^)(Pi,£.(y), Bj(x,y) = Y_ ^j^M'^iAv)- 

Therefore, 



Ajf(x) 



Bd 



Aj(x,y)f(y)dy = Y_ (f- = Y- '^j.t^j,^,' = (f' ^j,^,)- 



£.6Xj 



£.exj 



and 



Bjf(x) 



Bd 



Bj(x,y)f(y)dy = ^ (f,i|^j,f,)^^j,£, = ^ p5,£.^l^5,£., |3j,£, = (f,il^5,£.)- 



texj 



texi 



It is easy to prove (see [20|) that 



Ikj,dl2<i. 



From ® and the fact that ^^>o^("t2 = 1 for t E [1,oo), it readily follows that 



j>-i texj 



and taking inner product with f this leads to 



i2 = LLKf-^j,t)i'- 

J texi 



which in turn shows that the family {i^j,f,} is a tight frame for L'^(B'^). 
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4 Needlet inversion of a noisy Radon transform and min- 
imax performances 

Our estimator is based on an appropriate thresholding of a needlet expansion as follows, 
f can be decomposed using the frame above: 

Our estimation procedure will be defined by the following steps 



1 



gk,i,idY, (10) 



3j,t = L^klA.^.^ (11) 

k,l,i 

with yk,i,i = (gk,i,i.il^i,£,) 

j=-l £,exj 



with 



T = (d-1)/2. (13) 

Hence our procedure has 3 steps: the first one (|10p corresponds to the inversion of the 
operator in the SVD basis, the second one ([TT]) projects on the needlet basis, the third 
one (|12p ends up the procedure with a final thresholding. The tuning parameters of this 
estimator are 

• The range Je of resolution levels will be taken such that 

• The threshold constant k is an important tuning of our method. The theoretical 
point of view asserts that for k above a constant (for which our evaluation is 
probably not optimal) the minimax properties hold. Evaluations of k from the 
simulations points of view are also given. 

• C£ is a constant depending on the noise level. We shall see that the following choice 
is appropriate 

• Notice that the threshold function for each coefficient contains 2'"^. This is due to 
the inversion of the Radon operator, and the concentration relative to the gk,i,i's 
of the needlets. 
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• It is important to remark here that unhke the (hnear) procedures proposed in [lo| . 
this one does not require the knowledge of the regularity, while as will be seen in 
the sequel, it attains bounds which are as good as the linear ones and even better 
since handling much wider ranges for the parameters of the Besov spaces. 

We will consider the minimax properties of this estimator on the Besov bodies con- 



structed on the needlet basis. In it is proved that these spaces can also be described 
as approximation spaces, so they have a genuine meaning, and can be compared to stan- 
dard Sobolev spaces. 

We define here the Besov body B^^ as the space of functions f = IIj>_i Il£,exj ^i.t^j.t 
such that 

)T/7t 
< OO 

(with the obvious modifications for the cases 7t or r = oo) as well as B^-|.(M) the ball of 
radius M of this space. 




Theorem 1. For < r < oo, 7t > 1, 1 < p < oo there exist some constant Cp 

I _ 

■ 7t p J 



Cp(s,r,p, M), kq such that if k > kq, s > (d -|- 1 )(;^ — :^)+, in addition with if n < -p, 



71 1 



1- Ifl< " 



d+l 



sup (E||f-f||P)T' <Cp(logl/e)2 eVlogl/e 



s-(d+1)(1/7i-1/pl ^ s A s-2(1/7r-1/p) 
s + d-(d+l)/7r s+d-2/7r 



d ^ 1 / 5d-l 



2. If^,<i< 



d+A - V 4d+l 



sup ^^||, 
feB^,,.{M) 



s-2(1/7r-1/p) 



+ d-l/2' s+d-2/7t 



O Tf 5d-1 I 

J- 4d+l ^ p 



1 



s+d-1 /2 



sup {¥.\\i-%y < Cp(logl/£)2 eVlogl/e 

Remark 1. Up to logarithmic terms, the rates observed here are minimax, as will appear 
in the following theorem. It is known that in this kind of estimation, full adaptation 
yields unavoidable extra logarithmic terms. The rates of the logarithmic terms obtained 
in these theorems are, most of the time, suboptimal (for instance, for obvious reasons 
the case p = 2 yields much less logarithmic terms). A more detailed study could lead to 
optimized rates, which we decided not to include here for a sake of simplicity. 
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Figure 1: The three different minimax rate type zones are shown with respect to the 
Besov space parameters s and 7t for a fixed loss norm U' with < ^ < ^ . 



I : 




Figure 2: The three different minimax rate type zones are shown with respect to the 
Besov space parameters s and n for a fixed loss norm p with \ < ^ < ■ 
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The cumbersome comparisons of the different rates of convergence are summarized 
in Figures [D and [2] for the case < < . 

For the case of a Lqo loss function, we have a shghtly different result since the 
thresholding is here depending on the Lqo norm of the local needlet: Let us consider the 
following estimate : 

^ H H '^''^^li3j,£.lll^i,dloo>K2idc,}'^j.t 

j=-1 £.GXj 



Then for this estimate, we have the following results : 



Theorem 2. For < r < oo, n > ^ , s > 



7t 



There exist some constants Cqo = Coo(s,7t, r, M) such that if k > 4Too, where Tq 



s+d-(d+1 )/7r 



SUPj,f.2 '""i' ||l|jj,£,||oo 

-(d+1 )/7r 

sup E||foo -f||oo < Coo (eVlogl/e 

feB^_,{M) 

The following theorem states lower bounds for the minimax rates over Besov spaces 
in this model. 



Theorem 3. Let £ be the set of all estimators, for < r < oo, 7t > 1 , s > 
a) There exists some constant Cqo such , 



d+1 

7T - 



inf sup E||f* - f lloo > Coo UVlogl/e 

ieB^„ ,{M.) ^ 



s-(d+l 
s+d-(d+l )/7r 



b) For 1 < p < oo there exists some constant Cp such that if s > ^^)+, 

p ^ d+l 



1- If'^<- 



S-(d+l)(l/7I-l/p) ^ S ^ S-2(1/7I-l/p) 



inf sup (E||f* - f||{^) p > CMe s+d-(d+i)/7t 'N+d-i/2'^ s+d-2/71 

f6BJ=, ,,{M) 



2- v^<^< 



I ^ 5d-1 
p ^ 4d+l 



_^^ s-2(1/7i-1/p) 



O Tf 5d-1 < I 



inf sup (E||f*-f||]^)p > CMe^+'i-'/z'^ s+i-i/n 

f6B^,,,(M) 



inf sup (E||f*-f||]^)p > CMe ==+1-1/2 
ie'?>'„ ,[M.) 
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Figure 3: Simplified CAT device 



5 Applications to the Fan Beam tomography 
5.1 The 2D case: Fan beam tomography 

When d = 2, the Radon transform studied in this paper is the fan beam Radon transform 
used in Computed Axial Tomography scanner (CAT scan). The geometry of such a 
device is illustrated in Figure [3l An object is placed in the middle of the scanner and X 
rays are sent from a pointwise source S(0i ) making an angle 9i with a reference direction. 
Rays go through the object and the energy decay between the source and an array of 
receptor is measured. As the log decay along the ray is proportional to the integral of 
the density f of the object along the same ray, the measurements are 



Rf (61,62)= f(x)dA 

with 69 = (cos 6, sin 6) or equivalently the classical Radon transform 

Rf(6,s)= ^^g^ f(s0 + y)dy, G S\ s G [-1 , 1], 

se+yeB^ 

for 6 = 61 — 62 and s = sin 62. The ray source is then rotated to a different angle and 
the measurement process is repeated. In our Gaussian white noise model, we measure 
the continuous function Rf (0, s) through the process dY = Rf (6, s)d0-pj-^|7^ + edW(6, s), 

where the measure d6 ^^jj corresponds to the uniform measure d0id62 by the change 
of variable that maps (61,62) into (6,s). Our goal is to recover the unknown function 
f from the observation of Y using the needlet thresholding mechanism described in the 
previous sections. 
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In our implementation, we exploit the tensorial structure of the SVD basis of the 
disk in polar coordinates: 

fic,u(r, e) = (2k + 2)i/2pJ0.^)(2|r|2 - 1 )|r|Xi(0), < I < k, k - I = 2j, 1 < i < 2, 

where P-''^ is the corresponding Jacobi polynomial, and Yvj (9) = Ci,cos(l9) and Yi.^2(0) = 
Ci,sin(l0) with Cq = -4= and Ci = A= otherwise. The basis of S'^ x [—1,1] has a similar 

V27I 

tensorial structure as it is given by 

gw(e, s) = [hij-^/2(i - s^)^/^cl(s)Yv,i(e), k > o, i > o, i < i < 2, 

where is the Gegenbauer of parameter 1 and degree k. We recall that the correspond- 
ing eigenvalues are 

5.2 SVD, Needlet and cubature 

In our numerical studies, we compare four different type of estimators: linear SVD esti- 
mators, thresholded SVD estimators, linear needlet estimators and thresholded needlet 
estimators. They are defined from the measurement of the values of the Gaussian field 
on the SVD basis function Yg^ ^ . and the following linear estimates of respectively the 
SVD basis coefficients (f,fk,i,i) and the needlet coefficients (f,t^j,£,), 



gic,i,vdY and 
Pj.f, = ^b(k/2i)^ gk,i,i(yaic,i,i 



The estimators we consider are respectively defined as: 

linear SVD estimates fj"^ = ^ ^ ^k.i.if k,i,i 

k<2' l,i 

linear needlet estimates fj"'^ = ^ ^ Pj.f.^l'j.f, 

thresholded SVD estimates fj^ = ^ ^ pj^ (ctic,i,i) fk,i,i 

k<2J l,i 

thresholded needlet estimates = ^ ^ pjj ^ (Pi.t) 4'j,£, 

where Pt(0 is the hard threshold function with threshold T: 



Pt(xJ 



X if |x| > T 
otherwise 
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Observation 
SVD Dec 
Inv. Radon 

Needlet transf. 
Coeff. mod. 
Needlet inv. 
SVD rec. 



Linear SVD 



"-k.Li = lk<kn,ax'^k,l,i 



Linear SVD 



Thresh. SVD I Linear Needlet 



Thresh. Needlet 



dY = Rf de 



ds 



vr 



+ edW 



Yhk.i.i = (Y, gk,i,i) = (Rf , gk,i,i) + £k,i,i = M-k(f , fk,i,i) + ek,i,i 

ak,i,i = — {X gk,i,i) = (f>fk,i,i) + — £k,i,i 

M-k M-k 



"kli = PTk (ak,i,il 



k V ^ ^ 1,1 



a- 



k,l,i 



Thresh. SVD 



k,l,i 



|1N 



Linear Needlet 

Table 1: Algorithmic description of the considered estimators 



Thresh. Needlet 



A more precise description is given in Table [TJ In our experiments, the values of Yg^ ^ ^ 
have been obtained from an initial approximation of (f , fk,i,i) computed with a very fine 
cubature to which a Gaussian i.i.d. sequence is added. 

We have used in our numerical experiments thresholds of the form 



Ak 



and 



Tj.f. = Kaj,f,£0Ogl/£ 



where aj,f_ is the standard deviation of the noisy needlet coefficients when f = and 
£ = 1: 

k l,i 

Note that while the needlet threshold is different than in Theorem [H as cTj,£, is of order 
2^'^ its conclusions remain valid. 

An important issue in the needlet scheme is the choice of the cubature in the needlet 
construction. Proposition [1] ensures the existence of a suitable cubature f,j for every level 
j based on a cubature £,j on the sphere but does not give an explicit construction of the 
points on the sphere nor an explicit formula for the weights Wj^s,- Those ingredients are 
nevertheless central in the numerical scheme and should be specified. Three possibilities 
have been considered: a numerical cubature deduced from an almost uniform cubature 
of the half sphere available, an approximate cubature deduced from the Healpix cubature 
on the sphere and a cubature obtained by subsampling a tensorial cubature associated 
to the latitude and longitude coordinates on the sphere. The first strategy has been 
considered by Baldi et alQ] in a slightly different context, there is however a strong 
limitation on the maximum degree of the cubature available and thus this solution has 
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been abandoned. The Healpix strategy, also considered by Baldi et al. in an other 
paper [1], is easily implementable but, as it is based on an approximate cubature, fails 
to be precise enough. The last strategy relies on the subsampling on a tensorial grid 
on the sphere. While such a strategy provides a simple way to construct an admissible 
cubature, the computation of the cubature weights is becoming an issue as not closed 
form are available. 

To overcome those issues, we have considered a cubature formula based on the full 
tensorial grid appearing in the third strategy. While this cubature does not satisfy the 
condition of Proposition [H its weights can be computed explicitly and we argue that, 
using our modified threshold, we can still control the risk of the estimator. Indeed, note 
first that the modified threshold is such that the thresholding of a needlet depends only 
on its scale parameter j and on its center £, and not on the corresponding cubature weight 
tUj_£,. Assume now that we have a collection of K cubature, each satisfying conditions of 
Proposition [1] and thus defining a suitable estimate f^, the "average" cubature obtained 
by adding all the cubature points and using their average cubature weight defines a new 
estimate f satisfying: 

k=l 

By convexity, for any p > 1 , 

k=l k=1 

and thus this average estimator is as efficient as the worst estimator in the family f^^. 
We argue that the full tensorial cubature is an average of suitable cubature and thus 
that the corresponding estimator satisfies the conclusion of Theorems [1] and [2j Remark 
the proximity of this principle with the cycle-spinning method introduced by Donoho 
et al., we claim that the same kind of numerical gain are obtained with this method. 
The numerical comparison of the Healpix cubature and our tensorial cubature is largely 
in favor of our scheme. Furthermore, the tensorial structure of the cubature leads to 
some simplification in the numerical implementation of the needlet estimator so that 
this scheme is almost as fast as the simplest Healpix based one. 

5.3 Numerical results 

In this section, we compare 5 "estimators" (linear SVD with best scale, linear needlet 
with best scale, thresholded SVD with best k, thresholded needlet with best k and 
thresholded needlet with k = 3) for 7 different norms (Li, L2, L4, Lg, L7, Lg, L-\o and 
Loo ) and 7 noise levels £ {2^/^ 000 for k in 0, 1 , . . . , 6) . Each subfi gure of Figured] plots the 
logarithm of the estimation error for a specific norm against the opposite of the logarithm 
of the noise level. Remark that the subfigure overall aspect is explained by the errors 
decay when the noise level diminishes. The good theoretical behavior of the thresholded 
needlet estimator is confirmed numerically: the thresholded needlet estimator with an 
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optimized k appears as the best estimator for every norm while a fixed k yields a very 
good estimator except for the L^o case, as expected by our theoretical results. This 
results are confirmed visually by the reconstructions of Figure [5l In the needlet ones, 
errors are smaller and much more localized than in their SVD counterparts. Observe also 
how the fine structures are much more preserved with the thresholded needlet estimate 
than with any other methods. 

We conclude this paper with some sections devoted to the proofs of our results. 



6 Needlet properties 
6.1 Key inequalities 



The following inequalities are true (and proved in [16|],[15|, [19|], [I4]) and will be funda- 
mental in the sequel: In the following lines , gj^f_ will stand either for (pj^£^ or i^j.f,) 

VjGN,Vf,GXj, 0<c< ||g^,dli< 1 (14) 
VjGN,£,GXj, VxG;f, ||gj,dhlgj,tWI < C<oo (15) 

2jd/2 

lgj^(x)|<CM , : 77 (16) 

(recall that Wj(x) has been defined in ([8])). From these inequalities, one can deduce the 
following ones (see 11[): For all 1 < p < 00, 

\ Vp 

XKf,gj.t)Pll9j,din <c||f||p (17) 



}lAtgj,t(x)||p< (I) (^llAtgi.dl^ 



Vp 

(18) 



6.2 Besov embeddings 

It is a key point to clarify how the Besov bodies spaces defined above may be included 
in each others. As will be seen, the embeddings will parallel the standard embeddings 
of usual Besov spaces, but with important differences which in particular yield new 
minimax rates of convergence as detailed above. 

We begin with an evaluation of the different Lp norms of the needlets. More precisely, 
in [l3] it is shown that for < p < 00 

/ 2''^ \l/2-l/P 

||M^j,dlp~ll^j,dlp~ (^^j , texj. (19) 
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Needlet (f-^) 




SVD (f-S) 





Figure 5: Visual comparison for the original Logan Shepp phantom with e = 8/1000. 
Errors are much more localized in the needlet based estimates compared to the fully 
delocalized errors of the SVD based estimates. Fine structures are much more restored 
in the thresholded needlet estimate than in the other estimates. 
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The following inequalities are proved in [lo| 

^ ||gj,t||P < c2'('iP/2+'P/2-2'+' p_^4^ (20) 

}^ ||g^,d|^ < ciZ'^iP/^ if p^4^ (21) 
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We are now able to state the embeddings results (see 
Theorem 4. l<p<7t<oo^ B^^ C ^. 

2. 

oo>p>7t>0, s > (d+l)(1/7r-1/p), ^B^, CB;7/'^+^"^/""^/^' 
7 Proof of the upper bounds 

A important tool for the proof of the upper bounds which clarifies the thresholding 
procedure is the following lemma. 

Lemma 1. For a// j > — 1, £, G Xj; a Gaussian distribution with mean pj £_ and 

variance ^, with 

Proof of the lemma As we can write 

h,^ = Y. y^iA f + Z ^klif ^^^^ 

Rd Av 

k,l,i ■^'^ k,l,i ^ 

Here the summation is over {(k, : < k < 2', < I < k, I = k( mod 2), 1 < i < 
Nd-i(l)}- Since the Zi^_t,i's are independent N(0, 1) random variables, Zj_£^ ~ N(0,a?£^) 
we have 

= I^K/m £ £ c2.l<'-"e^ (22) 

k,l,i 

with c = (d/27t)'^^\ Here we used that {fkii} is an orthonormal basis for and hence 

^^.^^\yl^/ = \\^^A\l<^■ 

Let us now begin with the second theorem which proof is slightly simpler. 

7.1 Proof of Theorem [2] 

We have, if we denote 

Aj(f] ■.= ^Y. 
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foe - f Hoc < llfoo - Aj(f)|U + ||Aj(f) - filoo 

< llf J- r ll-fll„. 9-J(s-(d+l)/7T) 



Aj(f)||oo +Coo,^||f||B^,,2- 



We have used as B^_^ C B^^t'^^^'" , 



||Ajf-f||oo<Coo,.||f||B^„2-f(-('i+i'^'. 

Moreover 



S-(d+1 1/71 S-|d+1 



2-J(s-(d+i)/7t) < (^£^1^^^ < (^e^b^ 

1 

We have, using (fTSl) 



s-(d+l )/n+d 



as s > ^ 

7t 



Aj(f)|U < L II Z (^^^1{|0,,|||^,,|U>K2HeVi^} - P.^) ^i.^ll' 

j<J \ texi ^ ^ V J 

+ 11 L (pj.t^j.tl{|0,,|N.,dloo<K2i^eVk^}) II-) 

^ f Z (^^^P (''^^■^-P'.^lll^'.^ll-1{l0,dll^Kdloo>K2HeVI^^} 



+ sup { IPj,dll^i,d|ool||^^^^||,^^^^„^<,,,,^^g-f7^| 



We decompose the first term of the last inequahty, 

sup (l^i.t- Pj.dll^i.dlool|,^^^,|,^^^|,^>^,,,^^;j^|) 
= sup (li^i.t- Pj.dll^j,dlool||3^^^|„^^^^|,^>,,,,^^J^| 



^ (^^{lPj,£.lll^i,£.lloo>f2id£^/i^^} + ^{|(3j_tlN^j,tl|oo<f2fl£^/i^^} 
< sup (li^i,t-Pi.dll^i,dloo1|,^^^^_p^^^,||^^^^||^>.,,,,^;j^ 
+ sup (li^i,^- Pi,dll^i,dlool||p^^^|||^^^^||^^.,,,,^J^|) 
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V {l(?j,£ll|t|^i,tlloo<K2id£Vlogl/e} 



and the second one 

sup I |(3j,£.|||lj;j,f,||ool 
^ Xi V 

sup fl(3j,£.|||l|^j,f,||ool 

^ ("'{l|3j,£,lll^j,£.lloo>2K2)de^logV£} ''{||3j,£|||xPj,t||oo<K2i'ie^logl/e}) 



V {li^i,tlll^i,dloo<K2id£^logVe} 



+ sup ( |pj,f,i||l^j,£.||oo1 



{lPj,tlll^i,dloo<K2id£^log1/£} 



Now we will bound each of the four terms coming from the last two inequalities. Since 
for X ~ N (0, o^), we have 



IE(|Y|l{|Y|>Aa}) = C^ 



ye -y /^dy = e"^ < e 



In 



Noticing that the standard deviation of (|3j_£,— (3j_£^) ||i['j,f,||oo is smaller than Too2''^£ (using 
lemma [T] and p6|) ). we have: 



^2 /7t2 



if > 4t^ , where we have used Card Xj ^ c2''^. This proves that this term will be of 
the right order. 

For the second term, let us observe that we have, using theorem H] 

IPj.dll^j.dloo <Coo,n||f||Bs,,2-S(^-('i+l'/-', 

SO only the j's indexes such that j < ji will verify this inequality 



On the other side, using Pisier Lemma 



E sup (li^j.t- |3j,dll^j.dloo) < Too2''i£^21og2c2jd. 
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So 

)"e( sup f|^j,t-(3j,dll^j,dlool 



{l|3j,tl!l^j,tlloo>f2ii£^logV£} 



1 s— ( d+ 1 ) /n 

<Xo,Y_ 2'^£V21og2c2jd < Cejf " < C (||f ||b^, J^^^^^ (eVbgVl) 

This proves that this term will be of the right order. Concerning the first term of the 
second inequality, 



but 

(K2'''EVlogl/E)^ 

P(l0j,^-|3j,dll^j,dloo > Kl'^^elogl/e) <e 2(^2)(d-i)/2||„.^||^,2 < ^kV2t^ 



So 



)"E f sup f|(3j,dll^j,dlool 



{li^j,£-|3j,tl|l^i,tl|oo>K2)d£^log1/£} 



s-(d+l }/tx 



s+d-(d+l )/n 



< Coo,n||f||Bs,j£^ < C||f||B^,, (eVb^)^ 

if K'^/I > T^. This proves that this term will be of the right order. Concerning the 
second term of the second inequality, 



V {l|3i,£.l!l'^i,tiloo<2K2)d£^log1/£} 



sup ||3j,f,|||l|jj,£.||ool 

f.exi V 

<2K2''i£7b^ACoo,n||f||B|.,2-i(^-('i+i'/- 

let us again take 



2'i ~ (eVloglA 

Y_ sup (l|3j,dll^^J,dlool{|p.^^||,^. ^„^<2K2id£loglA}) 



s+d-(d+l )7i 



<C||f||B|, I £logl/£j^2fi+ Y_ 2-'(^-('i+i'/-'| <C||f||Bs,,. (e^bFlA)^ 

\ 3<)1 Jl<J<J 

This ends the proof of Theorem [2] 



s-(d+l )/7r 
+ d-(d+l )/7r 
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7.2 Proof of Theorem [T] 

As in the previous proof we begin with the same decomposition, 
||fe-f||^<2^-i||f.-Aj(f)||^ + ||Aj(f)-f||^ 

But 

||Aj(f)-f||^<C||C.^^2-J^T' if n>v 

and 

||Aj(f)-f||P< C||f||^^^2-J(^-('i+i'fi/"-i/P"P .f 
if we use the fact that for 7t > p, B^,. C Bp^^^, and for tt < p, B^,. C Bp^*^^^"^^"^^^^^'. 

sp sp sp 

We have 2-^*^ < (elogl/e)^ < (e^logl/e)^ < (e^og 1/e) ^+"^-7 . Obviously, 
this term has the right rate for tt > p. For tt < p, 

Tf M^IWl/ 1/U (s-|d+1)(V^-1/pll (s-(d+l)(1M-1/p)) 

2-J(s-(d+l)(V"-1/p)) < (£logl/£) d-^ <(£log1/£) s + d-(d+1)/^ ^ 

thanks to s > (d+1 )/tt— 1 /2. This gives the right rate for dp > d+1 . For dp < d+1 , we 
have (again as s > (d+l)/TT-l/2 ), s-(d+l](^-^) > d-^, so: l-^'^-t'i+litV^-Vp)) < 

|s-ld+1)(1/7r-1/p)) s 



"T" 



(elogl/e) "^"7 < (e log 1/e] "+'^"7 . Finally this proves that the bias term 

above always has the right rate. 

Let us now investigate the stochastic term: 



j<J £,exi ^ ^ ' ^ 



But 



<C( }^(li^j,t-Pi,d^||xl;3,dlp 



{li^j.£,l>K2i^£^log1/£} 



+ XlPj,d^||^l^j,dlP, 



||0j,t<K2)^£Vl°S V4 
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In turn, 
and 

^ (^{l|3j,tl>2K2)d£yiog1/£} + ^{|Pj,tl<K2ide^logl/e}) 

We now have the following bounds: Using Pisier lemma 
Hence, 



j<J f,exi 



IP 
Ip 

j<J texi 



< C^/b^^^^P (^Ky^]^iT7iy ' £-V2^2^PV2j(dp/2+(p/2-2) + 



if K > y/2p is large enough (we have used (fT9 
Using the forthcoming inequality ([2^ 



-q 
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Hence, 



-p/K\/ r T—\P-^ 



Hence this term is of the right rate. Let us now turn to: 



but as the standard deviation of (3j_f_ — (3j_£^ is smaller than £2'''^^^'/^ 



So 



|li^i,t-pj,£l>K2i^£7logVej 
j<J ^£,exj ^ ' 



if is large enough (where we have used that B^^ C B*'^ C L^^). Hence, this term also 
is of the right order. 

Let us turn now to the last one: using {^^i 



^6x 

Hence, 



p-q 



This proves that all the terms have the proper rate. It remains now to state and prove 
the following lemma. 
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Lemma 2. Let A = {(s,7t), s > (d + - :1) n (s > 0)}, and f G B^,., 1 < tt < 
oo, 1 <P < oo. f = ^.£^g^. |3j,£,\l;j,£,, |3j,£, = (f,ilJj,£,). Suppose that ^^^^.{\^^^E,\H^,d 



where q < p is as follows: 

1. p — q = 3_)_^^^-|/2 ('I ~ ) following domain I: 

{(s,7r), (s(1/p - 1/4) > (d - 1/2)(1/7r- 1/p)) n A}. 

Moreover we have the following slight modification at the frontier: the domain 
becomes 

{(s,7T), (s(1/p - 1/4) = (d- V2)(l/7t- Vp)) n A} 
and the inequality 



^- P - q = ''"^s+d-l^"'' = i^^) ^he following domain II: 

{(s,7r) (s > dp(l/7r- 1/p]) n (s(l/p - 1/4) < (d - l/2)(l/7r - 1/p)) n A}. 

5- P - q = '^'IwY^iW^^ = sitlttlyJ ihe following domain III: 

{(s,7t), (dp(l-l) >s)nA, for 1<-A,-,}. 

7t p p d + 1 

This lemma is to be used essentially through the following corollary: 

Corollary 1. Respectively in the domains I, II, III, we have, for q described in the 
lemma, and f G B^^^ 



Z V M>x\ (2'"ll^ullp)" < Cp^A-'' (23) 



TO^/i an obvious modification for 

{(s,7t), (s(l/p - 1/4) = (d - l/2)(l/7t- 1/p)) n A} 
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Proof of the corollary let us recall that on a measure space (X, \x] we have, if h, € Lq(|j,) 

vWHl 



then |j,(|h| > A) < ^^j^ and, as q < p, 



|h|Pd^ < 



|H|<A 



(|h|AA)Pd^ 



px^^Vd^l > x)dx < 







p - q 



□ 



For the corollary we take X = Xj, M-(^) = (2''^||4'j,£.||p)^ and h(£,) — 
Proof of lemma [2] 

Let us fix q (chosen later) and investigate separately the two cases q > 7t and q < 7t. 
For q > 7t, uie /jaue using (|19p 



Pi, 



2)-v 



/ 7id \P/2-l 



\texi V 



\t6Xj 



£.GXj 

25d ^P/2-l^^"/''^ 



q/7T 



2jd \ (p/2-i)7T/q 



q/7T 



.£.exj 



^"/2-1 / 2jd N f(p/2-1)-(7T/2-1) 
q/7t 

2j(d+l){E2^+q{i-i) 



q/7t 



7T/2-1 



Choosing q such that (sq + d(p - q) + (d + 1 )q(^ - ^)) = gives q = s+d-(d+i]7f ^ence 

— ™-T^p; q - 7t = -Tt ^T^^'Y.TZiYj'' - Thus in domain III: 
s+d— (d+l J7t ^' ^ s+d— (d+lj7T 



p - q 



s-(d+i)(V7t-Vp)^. 



{^l < ^) ^ - (d + Dd/Tt- 1/p) > 0) n (s-pd(i/7t- 1/p) < 0)} 



we have 0<q<p, 7r<q, 
For q < n, we have using ([19]) 



texj 



Pi, 



2)-v 



texi 



2jd 



p/2-1 
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/ -lid \ (7t/2-1)q/7T , -jjj ^ (p/2-l)-(7T/2-l)q/7T 



((p/2-l)-(7r/2-l)q/7T)^ 



-'i£.-(w^)-") (£(4)- 

(\ q/7t / \ l-q/7t 



q/7t / \^ 1— q/7t 

"(p-q) 
q) 

(p-q) 



Now let us investigate separately the cases "[^^^^^ <>>> and = 4. 
Case ^^^^ < 4 Using i^^, ^ and we have 

Ij(f,q,p) < C2'^(p-q)pq2-'«^2''i(T'-^'/2 < Cp?. 

if we put -sq + (p - q)(d - 1/2) = 0, i.e. q = g^^, then p - q = > 0. 

So Tt- q = > ^ ^ > (d- 1/2)(1/7t- 1/p). 

And ^ = 3_(d-v2)p"(i/n-vp) < 4 ^ s(l/p - 1/4) >(d - l/2)(l/7t - 1/p). Hence, 
we only need to impose s(1/p — 1/4) > (d — 1 /2)(1 /tt — 1 /p), describing domain I 

{(s-(d + 1)(l/7r-l/p) >0)n(s>0)}n{s(1/p-l/4) >(d-l/2)(l/7r-l/p)} 

p|d-l/2) 

on which Ij(f,q,p) < Cp/+'"'^'. 
Case ^tal > 4 

(Tt-q) 

Using ([H]) , ([20]) and ([21]), we have 

Ij(f,q,p) < C2^Mv-c)p^2-^sc2H{v-c,)/22n^-2^)_ 

If we put (p - q ) d - sq - 2^ = ^ q = we have p - q = ^'f|Y-2/n^' P > 

0^s-2(l/7r-l/p) >0 and7t-q = '~t+d-2/n^^' ^ > ^ s - dp(l/7t - 1/p) > 0. 
Moreover = > 4 ^ s(l/p - 1/4) < (d - l/2)(l/7t - 1/p). Hence, 

on the domain 

{(0 < s)n(s > (d+l)(l/7t-l/p)}n(s > dp(l/7r-l/p))n(s(l/p-l/4) < (d-l/2)(l/7t-l/p))}, 

pd-2 

we have Ij(f,q,q) < Cp-^"-^^^ . 
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Case = 4 Using ([HD, ([20]) and (HI]), we have 

if (p — q](d — 1 /2) — sq=0<^q = V-^^^yJI' '^'^^^ realized either if p = 4 = 7t and 

for s > 0. or if p / 4, 7r / 4 and < q = Tt^E^ = P s+A-yT Moreover q < 7t and q < p 

^4<p<7t; or 4<7t<pand^tol=4^s(Vp-1/4) = (d-l/2)(1/7T-Vp)}. 
Hence on the domain 

{(s,7t), (s(1/p-l/4) = (d-l/2)(l/7t-1/p))n(s>0)n(s> (d+l)(l/7r-l/p))} 

P(d-1/2) ^ 

we have Ij(f,q,p) < Cpj'+'*"'^^ 



8 Proof of the lower bounds 

In this section we prove the lower bounds, i.e. for < s < oo, 1 < 7t < oo, < r < 
oo, < M < oo and B^^(M) the ball of radius M of the space B^^, and £ is the set 
of all estimators we consider 



cup(s,7t,r, M, e) = inf sup E||f* — f||^ 

fGB^ ,,(M) 

cUoo(s,7t,r, M, e) = inf sup E||f* — f||oo. 

The main tool will be the classical lemma introduced by Fano in 1952 [8]. We will use 
the version of Fano's lemma introduced in 0]. Let us recall that K(P, Q) denotes the 
Kullback information 'distance' between P and Q. 

Lemma 3. Let A he a sigma algebra on the space O, and At G i G {0, 1 , . . . , m} 
such that Vi / j, H Aj = 0, Pt i G {0, 1 . . . ,m} 6e m + 1 probability measures on 
[CI, A). Then if 

p:=supP,(A[), k:= inf ^yK{?,,Pj), 
i=o je{o,i,...,m}m f— 

1-7^] 



p > ^ A (C\/mexp(— k)) , C = exp 



(25) 



This inequality will be used, in the following way: Let Hf- be the Hilbert space of 
measurable functions on Z = S'^^^ x [—1 , 1] with the scalar product 

r ds 



^cp(e,s)tl;(e,s)da(e)-_^^^,^_^,/. 
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It is well known that there exists a (unique) probability on [CI, A] : Qf , which density 
with respect to P is 

^=exp(Wnf)-^||f||^Hj- 

Let us now choose fo)fi) • • • )fm in B^^(M) such that i 7^ j =^ l|fi~fjllp ^ ^ and denote 
P| = Q f^ . Let f* be an arbitrary estimator of f. obviously the sets A^^ = (||f* — f 11 < |) 

are disjoint sets and we have, for 1 7^ j, 

1 

Now 

cUp(s,7T,q,M,£) > inf sup E||f*-ft||]^ > f inf sup P f ||f* - fi||p > 



'<(Pi,Pi)-,,2 



|R(ft-fj)|2d^. 



Likewise, 



cUoo(s,7r,q,M,£) > ( ^ ) inf sup P ( ||f* - ft||oo > ^ 



Using the previous Fano's lemma 



sup P ( ||f* - filip > ^ ) > ^ A (CV^Kexp(-K)) 

ft,i=0,l..m V ^ 



with 



K = inf — y 
j=o,...M m ^— ze^ 



|R(f,-fj)|^d^. 
z 



So if for a given e we can find fo)fi) •••fm in B^'^(M) such that \ ^ ] =^ ||fi — fj||p > 
6(e) and Cv^exp(— k) > 1/2 then we have 

forp<cxD, cup(s,7r, q, M, e) > ■^5(£)^, and cUqo (s, tt, q, M, e] >'^b[i] 

In the sequel we will choose, as usual, sets of functions containing either 2 items 
(sparse case) or a number of order 2'*^ or 2'''^^^' (dense cases). We will consider sets 
of functions which are basically linear combinations of needlets at a fixed level f = 
Y-Ej^x Pj.t^i.t-- Because the needlets have different order of norms depending whether 
they are around the north pole or closer to the equator, we will have to investigate 
different cases. These differences will precisely yield the different minimax rates. 



8.1 Reverse inequality 

Because the needlets are not forming an orthonormal system, we cannot pretend that 
inequality (jlSp is an equivalence. Since precisely in the lower bounds evaluations we 
need to bound both sides of Lp norm for terms of the form Y-Ej^a- ^V^'),^ with Aj C 
the following subsection is devoted to this problem. 
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1 

c 



Proposition 2. Let us Aj C Xj- Then: 

\ i/p / \ 1/p 

Y_ I( L ^tM^i.^ I < II L ^t^i.tllp < C I }1 |Ad^||il^j,d|P 

^£,'eAj £,eAj / £.6Aj \teAj 

Proof 

Let f = L^gA. A^il;^,^. Clearly, by ([18]), 

Vp 

}1 A^xl;^,d|p < C I }^ Wm)A\l 
teAj \£.eAj 

and by ^ 

\ i/P 

L ^t^j.t -^i.t'>i^ii-^i,£'ii? I < cii Y. ^t^j.tii 

,£.'exj £,6Aj / f,eAj 

so obviously 

/ \ Vp / \ Vp 

H L KL ^tM^).^ ,^Lt'>l^ll^i.t'll^ < II L ^^^i.^llp ^^[l. I^d^ll^j.dl^ 

\f,'eAj £,GAj / £.eAj \£.eAj 

□ 

In the sequel we will look for subset Aj with the following property: There exists 

0<Dj, such that V£,GAj, ||i|jj^f^||p ~ Dj. 

(Here and in all this section, a^^^ - bj will mean that there exists two absolute constants 
Ci and C2 -which will not be precised for a sake of simplicity- such that Cibj < aj_£^ < 
C2bj, for all considered £,.) As precised above, Dj may have different forms depending 
on the regions: using (jl9p . we have 



y/^-vp 



^j.tiip 



(2-j + 



For this purpose, let us precise Proposition [T] by choosing the cubature points in the 
following way: we choose in the hemisphere S'J: strips Si< = B(A, (2k + 1 )r\) \ B(A,2kri) 
with r| ~ 1 ; k G {0, ... 2' — 1} (A is the north pole). In each of these strips, we choose 
a maximal ri-net, of points £, which cardinal is of order k*^^^. Projecting these points on 
the ball we obtain cubature points £, on the ball with coefficients a)j_£^ ~ 2^''^Wj(£,). As 
a consequence, we have in the set {x G M*^, |x| < -^j, about 2'*^ points of cubature for 
which 

D3HIM^j,dlp-2'^(V2-VP). 
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And in the corona {(1 —2 < |x| < 1}, we have about 2''*^ points of cubature for 
which 

D^~||il;j,t||p~2^('i+i'(i/2-i/p)_ 

Now let us consider a set Aj of cubature points included in one of the two sets 
considered just above (either {x e M*^, |x| < or {(1 — 2^^' < |x| < 1)), and consider 
the matrix (parametrized by Aj) 

M(Aj) = ((Tl;j,,,xl;j,,,)),,,,,A,xAi 
we have for any A G Ip(Aj), using Proposition [2] 

l|M(Aj)(A)|K^{A,)<C'||A|Kp{Aj) 
On the other way, let us observe that using p4|) 

so 

M(Aj) = Diag(M(Aj)) +M'(Aj) = Diag(M(Aj))(Id + [Diag(M(Aj)]-"'M'(Aj)) 

where Diag(M(Aj)) is the diagonal matrix parametrized by Aj extracted from M(Aj). 
Clearly, each terms of [Diag(M(Aj)]^^ is bounded by c^^ 
So if ||[Diag(M(Aj)]-iM'(Aj)||£(;p(A,)) < a< 1, we have 

||M(Aj)-l£(,^(A,))<c-i ^ 



1 - a 

Let us prove that we can chose Aj large enough and such that such an a exists. By 
Schur lemma 



||[Diag(M(Aj)]-%'(Aj]||£(;^(A,))<c-^ sup ^ KM^j.t-^j.t' 
Now using (fT6]) . 



Bd y/W^ (1 + 2id(x, (1 + 2id(x, £,mm 



dx 



By triangular inequality 



So 



■1 

r'l-^dr 





1 

(1 +2idr£„£,'))'^ 



VM, \{^\,^^^,^\>^^^,)\<CUTTT-^TJT^T7^ (26) 
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Now, let us choose Aj as a maximal Kr| net in the set Xj ^ ^ M ^ (case 
1) or as a maximal Kr] net in the set Xj H {(1 — 2^^' < < 1} (case 2). Recall that 
ri ~ 22I+1 ^^^d K will be chosen later. 

As, in case 1, 

Card{f,', d(^',f,)~Kl2-'}<(Kl)'i 

£,'5^£„£,'6Aj 1=1 y -r I V J 

if M. — d > 2, and K is large enough. In case 2, again 

Card{f,', d(t', £,) ~ K12-'} < (KD'^-^ 



21 2i 



1 ^ 1 . 2a 



£,'^£„£,'eAj 1=1 I -r J I, J 

if M. — d > 1 , and K is large enough. 

Hence M(Aj) is invertible in both cases and we have: 



\ Vp / \ 1/p 



\ Vp / \ Vp 

L lAd^lltl^i.dl? < II Z ^^^^.^IIp ^ L lAd^lltl^i.dl? 
,£,eAj / £,eAj \£,eAj 

8.2 Lower bounds associated speirse/dense cases and different choices 
of Aj 

Let j be fixed, and choose 

f,eAj 

We have 

2i-i<lc<2i + i 

where is the orthogonal projector on Vic(B'^). So 

||R(f)||2 = (rR(f),f)= Y. (AkPic(f),f) 

2i-i<k<2) + i 



< ( sup A^) ^ ||P^(f)||2 < c2-^(<i-i) ^ ^|2 
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8.2.1 Sparse choice, case 1 

Let fi = Y£i'4'j,£,i) € {1,2}, £i is +1 or —1 , in such a way that 

l|fi -fillp = \\r^j,£,, -y^hB,2\\v=yHh£,^ -'^j,£,2\\v-yiHhe,, II + l|i^i,f,2llp) 

In easel, \\^\>^^E,\\r - 2^^^^^^'^^""^ ■ So 

6 = ||fi - f2||p ~y2''^(^/^-"'/T'^ ~ 2-j(s+'i(V2-i/7r)-d(l/2-i/p) = 2-jt*-'i'^/''-^/T'^ 

On the other hand 

KfPi P2) = l_L2-'('i-l^Y2_ 1 _L2-i(d-l]2-2j(s+d(i/2-l/")} ^ 1 _L2-2j(s+d-l/2-d 

Now by Fano inequahty if j is chosen so that e ~ 2"''^+'^"^/^"'^/^^ 
(under the constraint s > d{^/n— (1 — 1/2d))) 



E||f* - ft||P > P(||f* - fillp > 6/2) > c 



So necessarily 

s-d(l/7i-1/p) 

Remark 2. // 

d[^/n-{^-^))<s< d(l/7t-l/p) 



so necessarily < 1 ~ then 

lim u)Tj(s,7t, q, M, e) > C > 

8.2.2 Sparse choice, case 2 

In case 2, ||iljj,£,||T ~ 2i('i+^5(i/2-Vr)^ 

g B^^(1) <^ Y2'''^+^^f^/^~''/^^ ~2~'* <^ ^ _2-'(^+('i+i)fi/2-''/^)' 
5 = ^^2'('^+^''^/^-''/T'' ~2-''^+t'^+^5f^/^-"'/''^-f'^+^''^/^-''/T'' =2-'^*-^ 

On the other hand 

K(Pi,P2) = -^2^''^"'' V = lJ^2^''^"'"2"^^''+''^+^^^^/^"^/"'" l_L2-2j(s+d-(d+i]/7t) 

Now by Fano inequality if e ~ 2-'(*+'i-''i+^'/^' 
(under the constraint s > (d + — -^)) 



0y Ellf-^ - UWl > Pdlf-' - fillp > 6/2) > c 



So necessarily 

(s-(d+1)(1/7r-1/p))p 

IE||f* - fill? > c6^ ~ e s+d-(d+i)/,t 
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Remark 3. // 

(d+1)(l/7T-^)<S<(d+1)(l/7t-1/p) 



(so necessarily ^ < -g^J, 



lim tUp(s,7t, q, M, e) > C > 



8.2.3 Dense choice, case 1 

In this case we take now 

fp = y X ££.^j,£,. ££. = ±1 , P = (e£,)£,eAj 
As we are in case 1, we have 

£,eAj £,eAj 

Using Varshamov-Gilbert theorem, we consider a subset A of {— 1,+1}'^' such that 
Card(^) ~ 2ic^'"'^(^i) and for p ^ p', p,p' G A, ||p- p'||i > ^Card(Aj). Let us 
now restrict our set to 

fp = T X = ' P = (££.)£.eAj , P e A 

£,eAj 

So we choose 
Moreover 

1/p 

6 = l|fp-fp'llp=Tll X(^^-4)^^.^llp~^l Z l^^-4i^ll^j>^ll?' 

£,eAj \£,eAj 
_ ^2''i(i/2-Vp)||p _ p'll^/T' 2-'(^+'i/^'2''^/^ = 2~'^ 
Let us compute the Kullback distance: 
K(Pp,Pp,) = _L2H(d-l)||f^_f^,||2^ _l_2H(d-l)2-2Ks+d/2]2jd^ _l_2-2Ks+d/2-V2^ 

so by Fano inequahty 

II IIP ^ i/2Ac28^' e ^ > 1/2 
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if 

So 

inf E||f-f||P > ce^+d-'/z 
8.2.4 Dense choice, case 2 

Similarly to the previous case, we take now (with a slight abuse of notation since the 
subset A obtained using Varshamov-Gilbert theorem is not the same A, as Aj has also 
changed) 

fp = Y X ^^.^j.f.' = ±1 ' P = (£f,)£,6Aj , P&A. 
As we are in case 2, we have 

Til X e^^j,di;-T'2^('i+i'(V2-i) ^ |e^r.y-2i('i+i'(-/2-i)Card(A0~Y'2^K'i+i'^/2-2] 
£,eAj £,eAj 

So we choose 



W^^AZ^ .y(2i('i-i)2'('i+i)(-/^-i^)^'^^T2- 



y^2-'(^+^-i5. 

Moreover 

1/p 

6 = ||f p - f p' lip = Til Y. - 4)^).tllp ~ T I Z - 4l^ll^j,^llp ' 

£,6Aj \£,eAj 

^^2^^d+^)(^/2-^/v)^^p_ pi^^yv ^y2^{d+^)(^/2-^/v)2iid-^)^ 
^ 2-'(^+^-i^2''^~t^ = 2-5(s-2(V7t-i/p)) 
Let us compute the Kullback distance: 

K(Pp,PpO = 2^2^(^-i^||fp-fp,||i~ = ^2-2'(^+^-i: 

so by Fano inequality 

^ll^;/llg>l/2Ac2i^"'-"e-A^-"'""^-*'>l/2 

if 

So 

^ p(s-2(1/7r-1/p)) 

inf E||f-f||P>ce s+d-2/,r _ 
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Remark 4. The case p = oo can he treated using the same arguments, without difficul- 
ties. 
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